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Abstract. We give a simple proof of the Birkhoff theorem about existence of a faithful repre¬ 
sentation for any finite-dimensional nilpotent Lie algebra of characteristic zero. 


1. Introduction 

The Ado theorem says that any finite-dimensional Lie algebra admits a faithful finite-dimen¬ 
sional representation. It was first proved by Ado in 1935 p\f1 using a Lie-group technique (and 
as such, was restricted to the fields of complex and real numbers). Since then a few different 
proofs were given, including purely algebraic ones and those valid in the positive characteristic 
(the latter is due to Iwasawa; cf. (Bo, §7, Exercice 5]). A relatively recent new proof, due to 
Neretin, is given in [N]. All the known proofs involve the universal enveloping algebra, an infinite¬ 
dimensional object, and nontrivial facts about it (notably, the Poincare-Birkhoff-Witt theorem). 
Moreover, the arguments are different in zero and positive characteristics. Somewhat unusually, 
the positive characteristic case of the theorem is much easier than the characteristic zero one, 
due to the possibility to employ finite-dimensional induced modules constructed with the aid of 
a reduced finite-dimensional version of the universal enveloping algebra (in characteristic zero, 
such modules would be infinite). Perhaps because of all this, the Ado theorem is sometimes 
referred - in writings, and also in talks and private conversations - as a “strange theorem” ([N]) 
“surprisingly tricky to prove” on 52 . 3 ]). 

Here we give an entirely different proof of the Ado theorem, basing on properties of free nilpotent 
Lie algebras, and on simple combinatorics related to the tensor product of representations. The 
proof is elementary and does not involve universal enveloping algebras (in fact, it does not involve 
associative algebras at all and is intrinsic to the category of finite-dimensional Lie algebras). 
Another interesting feature of the proof is that it employs induction on the dimension of the 
algebra - not “from below”, as in many existing proofs of the Ado theorem, but “from above”, 
descending from an algebra for which the Ado theorem is already established. 

The drawbacks of the proof are that it is valid for nilpotent algebras and in characteristic zero 
only (first established in full generality by Birkhoff in 1937 jBi], thus sometimes referred as “the 
Birkhoff theorem”). 

We present the proof in Section [2] as a series of (elementary) lemmas. The Ado theorem is one 
of the cornerstone results of today’s structure theory of Lie algebras, used in plenty of proofs and 
arguments. When pretending to give a new proof of such a basic result, one should be especially 
careful not to fall into the trap of circular arguments. That is why, even when using known 
and/or elementary results, we at least outline their proofs. We also carefully isolate places where 
we need assumptions such as nilpotency of the algebra and characteristic zero of the ground field. 
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Being different, our proof still shares with all the previous proofs some elementary (and seem¬ 
ingly unavoidable in this context) tricks and observations. In particular, as for a centerless Lie 
algebra the Ado theorem is trivial (the adjoint representation will do), we will concentrate on 
central elements, and take the direct sum of representations to assemble “local” Ado properties 
(nonvanishing on particular elements) into the “global” one (faithfulness). 

In Section [3] we speculate about possibility to extend this proof to the case of positive charac¬ 
teristic. 


2. The proof 

In what follows, the ground field K is arbitrary, and all algebras, modules and vector spaces 
are finite-dimensional, unless stated otherwise. 

A Lie algebra structure on the tensor product L ® A of a Lie algebra L and an associative 
commutative algebra A is defined by the obvious factor-wise multiplication: [x ® a, y <g) b\ = 
[x, y\ <£> ab for x,y G L,a,b G A (such Lie algebras are dubbed as current Lie algebras , the term 
coming from physics where they play a role). 

Lemma 2.1 (Embedding of graded algebras into a tensor product). An N-graded 
Lie algebra L is embedded into the Lie algebra L ® tK[t]/(t n ) for some n G N. 

(We denote by N the set of all positive integers, so the algebras we consider are positively- 
graded) . 

Proof. Let L = be the N-grading (necessarily finite, as L is finite-dimensional). The 

algebra L is embedded into L <g) tK[t\ via x ha x <g) t\ where x E L^, 1 < i < n — 1. Since 
[Li ® t\ Lj ® td] = 0 if i + j > n, this embedding factors through the ideal L <8) t n K{t\. □ 

Remark 2.2. The statement is obviously true for arbitrary, not necessary Lie, N-graded algebras. 

For the purpose of this note, a representation p of a Lie algebra L (or the corresponding L- 
module) will be called nilpotent, if p{x) is a nilpotent linear map for any x G L (of course, due to 
the Engel theorem, this is equivalent to existence of a positive integer n such that p{x i) • • • p{x n ) = 
0 for any x\, ..., x n G L, but we will not need that). 

Recall that given a representation p : L —> End(E) of a Lie algebra L, an 1-cocycle is a linear 
map <p : L -A V such that 

<p([x,y]) - p(x){<p{y)) + p(y)(<p(x)) = o 

for any x,y G L. The space of all such 1-cocycles is denoted by Z 1 (L, V), and they may be 
thought as derivations of L with values in the L-module V. 

Lemma 2.3 (A nondegenerate cocycle implies Ado). Let L be a Lie algebra, V an L- 
module (respectively, nilpotent L-module), and p is 1-cocycle in Z 1 (L,V) such that Ker<^ = 0. 
Then L has a faithful representation (respectively, faithful nilpotent representation). 

Proof. The required representation p is given by an action of L on E © Z 1 (L, V) (direct sum of 
vector spaces), defined naturally on the first direct summand, and via p(x)('if) = ^(x) for x G L 
and if G Z 1 (L, V), on the second direct summand. □ 

Remark 2.4. It is possible to extend this statement, via induction, to higher-order cocycles, but 
we will not need this (but see Remark 12 .1 21 below). 

Lemma 2.5 (Ado for graded algebras). An N -graded Lie algebra over a field of charac¬ 
teristic zero has a faithful nilpotent representation. 

Proof. Let L be such Lie algebra. By Lemma 12.11 L is embedded into L <8> tK[t]/(t n ). The 
derivation (i.e., 1-cocyclc with values in the adjoint module) idof L ® tK[t]/(t n ) acts on a 
nonzero element x % ® f non-trivially, hence has zero kernel. By Lemma [2.31 L <gi tK[t]/ ( t n ) 

has a faithful nilpotent representation. Hence so does its subalgebra L. □ 
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Lemma 2.6 (Ado for free nilpotent algebras). A free nilpotent Lie algebra of finite rank 
over a field of characteristic zero has a faithful nilpotent representation. 

Proof. A free nilpotent Lie algebra of finite rank is finite-dimensional and N-graded. Apply 
Lemma 12.51 □ 

Lemma 2.7 (Local Ado implies global AdcQ). Let L be a Lie algebra such that for any 
nonzero x £ L there is a nilpotent representation p x of L such that p x (x) 0. Then L has a 
faithful nilpotent representation. 

Proof. Pick a nonzero element X\ £ L , and set f)\ = p Xl . If p± is faithful, we are done, if not, 
there is a nonzero X 2 £ L such that pi(x 2 ) = 0. Set p 2 = p\ © p X2 ■ Note that 

Ker p 2 = Ker p x fl Ker p X2 C Ker p\ , 

and the inclusion is strict, since X 2 does not belong to the left-hand side, but belongs to the 
right-hand side. Repeating this process, we get a series of representations pi, p 2 ,..., with strictly 
decreasing kernels. Since L is finite-dimensional, this process will terminate in a finite number of 
steps on a faithful representation p. Being the direct sum of nilpotent representations, p is also 
nilpotent. □ 

Lemma 2.8 (Abundance of ideals of codimension 1). Let I be a nonzero ideal of a 
nilpotent Lie algebra L. Then I contains an ideal J of L of codimension 1 in I such that [L , I] C J . 

Proof. This elementary result follows almost immediately from the definition of a nilpotent Lie 
algebra by a rudimentary Jordan-Holder-like argument. Namely, there is a chain of ideals of L 

0 = J 0 C A C • • • C / n _i C I n = L 

such that dim/,//,;_! = 1 and [L, I,] C / i _ 1 for any i = 1,2,... ,n (cf., e.g., [Bol §4, Proposition 
1]). The set {i | / C /,} contains n (and hence is nonempty), and does not contain 0 (since / is 
nonzero). Let k be the minimal element in this set. Then 

0 < dim 1/(1 n 4_i) = dim(/ + I k -i)/h-i < dim I k /1k-\ = 1, 

and hence the ideal I fl I k -i of L is of codimension 1 in I . We also have 

[L , I] c / n [ L , i k \ c / n ik-i, 

as required. □ 

Lemma 2.9 (Factorization of linear maps). Let V be a vector space, f,g linear maps in 
End(K), and Ker / C Kerp. Then there is a linear map h in End(K) such that g = ho f. 

Proof. This is an elementary linear algebra (cf., e.g., [ KM.. Proposition 6.8]). Fix a basis e ±,..., e n 
in V, and define a linear map h : Imf —> V by sending /(e*) to g^efi, i = 1,... ,n. This map is 
well-defined, since if JA Ai/( e *) — Si Tif( e i) f° r some A,, pi £ K, then Si(-^* — F*) e * ^ cs i n Ker / 
and hence in Kerp, and, consequently, Si ^*/( e *)) = Si F*/( e *))• Extend h to the whole 
V arbitrarily, say, by mapping a subspace complementary to Im / to zero. □ 

The next lemma contains the core arguments, of combinatorial character. 

Lemma 2.10 (Distinguishing elements by representation kernels). Let L be a Lie 
algebra over a field of characteristic zero, having a faithful nilpotent representation. Then for any 
two linearly independent elements x,y £ L, there is a nilpotent representation p of L such that 
Ker pfx) (jL Kerp(p). 


1 A reader of an earlier draft of this note remarked that after the present array of lemmas, one might expect 
the next one to be called Much Ado about nothing. The lemmas are elementary indeed. 
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Proof. Suppose the contrary: there are two linearly independent elements x,y G L such that for 
any nilpotent representation p of L , Kerp(x) C Ker p(y). By Lemma [2.91 

(1) p(y) = h p o p (x) 
for some linear map h p . 

Let p : L —* End(O) and r : L —* End(hL) be two nilpotent representations of L, and let 
n and m be indices of nilpotency of p(x) and r(x), respectively. The tensor product p <S> r is 
also nilpotent, so writing the condition ([Tj) for p <g) r and for vectors p(x) n_2 (n) <8> r(x) m ^ 1 (tc) 
and p(a;) n_1 (i;) <g) T(x) m ~ 2 (w), and taking into account the same condition for p and r, we get 
respectively 

h p (p(:r) n_1 (u)) <S> T(x) m ^ 1 (w) = h p ® T (p(a) n_1 (v) <g) r(t) m_1 (u;)) 

and 

P^)"- 1 ^) ® h T (r(x) ra -:V)) = h p ® T {p(x) n -\v) ® T(x) m -») 

for any v G Id, w G W. This implies that the linear maps ® id and id®/i T coincide on the 
vector space p(x) n ~ l (V) ® r(x) m_1 (hE), whence 

(2) fy>(p(20 n_1 (u)) = Ap(x) n_1 (u) 

and 

h T ^r(x) m_1 (in) j = Ar(a;) m_1 (in) 

for some A € K. Since this holds for any pair of representations p, r, (J2]) holds for any nilpotent 
representation p of L for some uniform value A G K. 

Further, writing the condition (jT]) for the tensor square p®r for vectors p(x) n ^ 3 (v)<^>T(x) m ~ 1 (w), 
p(x) n ^ 1 (v) ® r(x) m ~ 3 (w ), and p(x) n ~ 2 (v) ® r(x) m ^ 2 (w), and taking into account (J2D, we get 
respectively 

h p [p(x) n ~ 2 (v )) ® r(x) m_1 (M;) = h p(glT (p(x) n ~ 2 (v) ® , 

p(x) n_1 (n) ® h r ^r(x) m_2 (w)j = /i p0T (p(x) n_1 (v) ® T(x) m ~ 2 (w) S j , 

and 

(3) A^p(x) n_1 (n) ® r(x) m ” 2 (w) + p(x) r,_2 (u) ® r(x) m ~ 1 (w) j 

= W(p(®) n_1 (u) ® r(x) m - 2 (n;) + p(x)"- 2 (u) ® 

Summing up the first two of these equalities, and subtracting the third one, we get that the linear 
maps (h p — A id)<g)r(x) and p(x)®(h T — A id) coincide on the vector space p(x) n ~ 2 (V)®T(;x) rn ~ 2 (W ), 
whence h p — A id = —pp(x) and h T — A id = —pr(x) for some p G K as linear maps on p(x) n ~ 2 (V) 
and r(x) m ~ 2 (W), respectively. As this holds for any pair of nilpotent representations p, r, we get 
that 

(4) h p p(x) n ~ 2 (v )) = A p(x) n ~ 2 (v) + pp(x) n ~ 1 (v) 

for any nilpotent representation p of L, for some uniform value p G AT. 

On the other hand, the index of nilpotency of (p ® p)(x) is equal to 2n — 1, so writing (J4j) for 
the tensor square p ® p, and taking into account (j2J) in the case r = p (and m = n), we get 

(5) pp(x) n ~ l (v) ® p(a;) n_1 H = 0 
for any v,w G O, what implies p = 0, and, according to (J4j), 

^(p^)" - 2 ^)) — Ap(x) n “ 2 (n) 
for any nilpotent representation p of L. 
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Repeating this procedure (considering on each step the condition (jT]) for the tensor square of 
two representations p, r, and for all vectors of the form p(x) l (v) <g) T^xfitjw) with i + j equal to 
the index of nilpotency of p <S> r minus (k + 2)), we consecutively arrive at the equalities 

h P [p{x) k (v )) = Ap(x) fc (u) 

for any 1 < k < n. For k = 1 this means that h p = A id, and hence p(y — Xx) vanishes for any 
nilpotent representation p of L, whence y — Xx = 0, a contradiction. □ 

Finally, we can glue all of this together: 

Theorem 2.11 (Ado for nilpotent algebras). A nilpotent Lie algebra over a field of 
characteristic zero has a faithful nilpotent representation. 

Proof. Present a nilpotent Lie algebra L as a quotient of a free nilpotent Lie algebra F of a finite 
rank: L = F/I. We will proceed by induction on the dimension of /. The case I = 0 is covered 
by Lemma [2.61 

Suppose that / is nonzero. By Lemma [77B1 there is an ideal J of F such that J C /, dim I/ J = 1, 
and [F, I] C J. Consequently, S = F/ J is an extension of L by an one-dimensional central ideal, 
say Kz. 

Take an arbitrary nonzero x G L and consider its preimage x in S. By the induction assumption, 
S has a faithful nilpotent representation, and by Lemma 12.101 there is a nilpotent representation 
p : S —* End(R) such that 

(6) Ker p(z) (jL Kerp(x). 

Since 5 lies in the center of S, p(z) commutes with all maps from p(S'), and hence the space 
Kerp(5) is an S'-submodule of V, which also carries a natural structure of a (nilpotent) L- 
module, on which x acts nontrivially, due to ([6]). By Lemma 12.71 L has a faithful nilpotent 
representation. □ 

Remark 2.12. In the proof above, we may try to take a dual route, utilizing images instead 
of kernels. Namely, for any representation p : S —> End(E), the space V/Im p(z) also carries 
a natural structure of an L- module. If Imp(x) (jL Imp(5), then x acts on the latter module 
nontrivially, and this action is nilpotent if p is nilpotent, so we may assume that Im p[x) C Im p[z) 
for any nilpotent representation p of S. By the statement dual to Lemma 12.91 the latter condition 
may be rewritten as a dual one to ©: 

p(x) = p(S) o h p 

for some linear map h p : V —> V. Additionally, we may be helped by the fact that the element x 
in these considerations may assumed to be central. Indeed, write the central extension S of L as 
the vector space direct sum S = L © Kz , with multiplication 

{u, v} = [u, v\ + <p(u, v)z 

for u,v G L, where [ •, • ] is multiplication in L, and <p is a 2-cocycle on L with values in K. Then 
we may apply a “local” version of Lemma [2731 for 2-cocycles to deduce that <p(x, L) = 0, i.e. x is 
central in S. But even with this additional input, to prove the appropriate dual version of Lemma 
12.101 seems to be more tricky, and requires an extensive consideration of associative envelopes of 
p(Sy s in the appropriate matrix algebra. 

3. An alternative route 

In the proof of the Ado theorem for nilpotent algebras in Section [2l the characteristic zero of 
the ground field K is needed in two places: first, in the proof of Lemma [2.51 to ensure that there 
is a derivation D of tK[t\/(t n ) such that 5A >:1 x { <g> D(t l ) 0 (what is wrong if the characteristic 
of K is p > 0, and all the exponents i in nonzero terms of JA>i x i ® t l divide p); and second, in 
the proof of Lemma [2. 101 to ensure that the binomial coefficients arising in binomial-like formulas 
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for powers of Lie algebra elements actions on tensor products of representations, like those in (15|), 
do not vanish. 

Here we outline a possible alternative approach which should include the case of positive char¬ 
acteristic. In Section [2] we apply an elementary embedding of Lemma 12.11 to free nilpotent Lie 
algebras of finite rank. A little bit more involved argument establishes a similar embedding for 
a broader class algebras. This argument is sometimes phrased in terms of “generic elements” or 
“generic matrices” and is often employed in the theory of varieties of Lie, associative, and other 
kinds of algebras. Namely, a relatively free algebra F of no more then countable rank in a variety 
generated by a finite-dimensional Lie algebra L is embedded into 

L <g) (tiK[ti ,..., t k \ H-b t k K[t i,..., 4]) 

for some k G N (cf., e.g., (Rj Chapter I, Lemma 5.1] or [ Zai Proof of Proposition 1.3]). In the 
case of nilpotent L one can do even better and embed F into L (g) tK[t\ (cf. [Zaj, Lemma 1.1]). 
Also, in the case of nilpotent L this embedding obviously factors through the ideal L <E> t n K[t], 
where n is the index of nilpotency of L , similarly as in Lemma 12.11 

(Note parenthetically that this or similar arguments are often coupled with the Ado theorem to 
establish an embedding of a relatively free algebra in some variety of Lie algebras into an algebra 
with that or another finiteness condition (cf., e.g., [Za]). Here we reverse this line of reasonings 
and use this argument to outline a possible route to the Ado theorem). 

As the adjoint representation is nonzero on non-central elements, in view of Lemma 12. 71 in order 
to establish the Ado theorem for F, it will be enough to prove that for any nonzero central element 
of F, there is a representation not vanishing on that element. By an easy inductive argument, 
one may reduce considerations to the case where the center Z(L) of L is one-dimensional, so to 
cover the characteristic p case in the proof of Lemma 12.51 it will be enough to consider elements 
of F of the form 

z ® (Ai t p + A 2 t 2p + ...), 

where z G Z (L) and A, G K , which probably could be dealt with using additional considerations 
based on relative freeness of F. 

Additionally, one may try to employ derivations of L®tK[t]/(t n ) of the form other than icR 
where D is a derivation of tK[t\/(t n ). As explained in [ZuJ §3], the full description of derivations 
of such current Lie algebras is, probably, a difficult task, but one may try, for example, to employ 
derivations of the forms listed in |Zul Theorem 3]. 

After establishing the Ado theorem for any relatively free algebra F in a variety generated by 
a nilpotent Lie algebra L, we may proceed the same way as in the proof of Theorem in Section [21 
by induction on the dimension of ideal of relations determining L. To establish Lemma 12.101 in 
characteristic p, an additional care will be needed to deal with vanishing of binomial coefficients 
occuring in the proof. 

All this, however, will result in a quite long and involved proof - at least much more long and 
involved than the existing proof of the full-fledged Ado theorem in positive characteristic - so we 
will not pursue this approach. 

A final remark: in all existing proofs of the Ado theorem in characteristic zero, the general 
case is derived from the nilpotent one. In this respect, §2] is a good start. However, all such 
derivations employ the universal enveloping algebra in a non-trivial way. A short, “natural”, and 
characteristic-free proof of the Ado theorem is yet to be found. 
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